23:35
2026-06-10
lesswrong.com
artificial-intelligence
Thoughts on Claude Fable's silent safeguards
Anthropic released Claude Fable 5, its most capable Mythos-class model, with new safeguards that silently limit the model's effectiveness for requests related to frontier LLM development without notifβ¦